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ABSTRACT 

With a view to provide a user-friendly interface for design¬ 
ing, training and developing deep learning frameworks, we 
have developed Expresso, a GUI tool written in Python. 
Expresso is built atop Caffe, the open-source, prize-winning 
framework popularly used to develop Convolutional Neu¬ 
ral Networks. Expresso provides a convenient wizard-like 
graphical interface which guides the user through various 
common scenarios - data import, construction and training 
of deep networks, performing various experiments, analyz¬ 
ing and visualizing the results of these experiments. The 
multi-threaded nature of Expresso enables concurrent execu¬ 
tion and notification of events related to the aforementioned 
scenarios. The GUI sub-components and inter-component 
interfaces in Expresso have been designed with extensibility 
in mind. We believe Expresso’s flexibility and ease of use 
will come in handy to researchers, newcomers and seasoned 
alike, in their explorations related to deep learning. 



Figure 1: Expresso’s main screen 


Categories and Subject Descriptors 

1.5.1 [Pattern Recognition]: Applications-Computer Vi¬ 
sion; D.2.2 [Software Engineering]: Design Tools and 
Techniques —software libraries; 1.5.1 [Pattern Recogni¬ 
tion]: Models - Neural Nets 

General Terms 

Design, Experimentation 
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1. INTRODUCTION 

In the recent years, deep learning has established itself 
as a disruptive framework, raising the state-of-the-art for 
challenging problems related to signal recognition and analy¬ 
sis [^. In particular. Convolutional Neural Networks (CNNs), 
a category of deep learning networks, have shown impressive 
performance on challenging image recognition datasets 
[^. The rapid adoption of CNNs has been made possible to 
a large extent by the availability of ready-to-use software 
for designing CNNs and related data processing. Of these 
softwares, the Caffe framework is one of the most pop¬ 
ular. The detailed nature of source and usage documenta¬ 
tion along with an active user forum has contributed im¬ 
mensely to Caffe’s popularity. The fact that Caffe was the 
prize-winning entry in the competitive ACMMM-2014 Open 
Source Software Competition further promotes its credibil¬ 
ity. 

The Caffe framework is primarily command-line based. To 
supplement Caffe’s functionality and expand its ease of use, 
particularly for newcomers, we have developed Expresso. 
Expresso is a GUI tool written in Python which uses Caffe as 
the back-end framework for CNNs. The project page (http: 
//val. sere. iisc. ernet. in/expresso) provides an overview 
of Expresso’s main features and information regarding in¬ 
stallation and usage. 

























Figure 2: Using the intelli-sense editor for creating 
and modifying layers of a deep net 


Figure 3: Viewing the layers of a pre-trained deep 
net. The net’s layers are color-coded by type. 


2. SALIENT FEATURES OF EXPRESSO 


• Expresso provides a convenient wizard-like interface 
to contextually guide the user during common scenar¬ 
ios such as data import, design and training of CNNs, 
performing experiments and visualizing their results 
(Figure [^. The wizard interface avoids the cognitive 
load involved in switching between the GUI and read¬ 
ing documentation from another interface (e.g. PDF 
hie or browser window). 

• Expresso’s multi-threaded architecture enables asyn¬ 
chronous, concurrent execution of tasks in the afore¬ 
mentioned scenarios. 

• Important events that occur related to various tasks 
(task initialization, progress and termination) are con¬ 
veyed to users via a notihcation system similar to that 
present in social media user-interfaces (Figure]^. 

• The modular nature of GUI sub-components and inter¬ 
component interfaces in Expresso have been designed 
with ease of extensibility in mind. 


3. THE EXPRESSO FRAMEWORK 

Expresso operates primarily as four views - Data, Net, 
Train, Experiment. These “views” rehect the predomi¬ 
nant use-case scenarios involving GNNs. In Expresso’s GUI, 
each view and associated interface elements are rendered in 
a uniform color livery for ease of visual grouping (Figures 
W >- Next, we describe these views. 


3.1 Data view 

This view is meant for import of input data and typically 
forms the hrst step for most tasks. The import interface is 
quite hexible and enables data import in a variety of input 
format^- starting from the basic “all-images-in-the-folder” 
up to a sophisticated, high-capacity format such as HDF5. 
Auxiliary information typically associated with data such as 
class labels can also be concurrently imported. A summary 
of salient details related to the imported data are presented 
in a list-based view upon import. A rudimentary image 

^Gurrently, Text,LevelDB, .mat,HDF5,Folder formats are 
supported. 


browser is also provided which can be used for a quick pre¬ 
view of the imported data. In addition, the imported data 
can also be exported to other formats. Gurrently, .mat and 
HDF5 are supported. A common requirement involves split 
of data into training and testing sets. This functionality is 
also provided in the Data view. 

Another interesting feature is that the back-end frame¬ 
work related to Data view also features a parser. This 
parser can be utilized to quickly extend the import inter¬ 
face to load data in formats not provided yet in Data view. 

3.2 Net view 

The Net view is utilized for constructing deep neural net¬ 
work architectures (nets) - either from scratch or by modih- 
cation of existing nets. The net construction phase requires 
specihcation of a training ‘prototxt’ hie whose contents are 
interpreted by Gaffe as the specihcation for network archi¬ 
tecture. A related text hie, usually referred to as the ‘deploy’ 
prototxt hie is associated with the usage of the network for 
testing and other experimentation. Optionally, a solver hie 
is also present which contains various auxiliary parameter 
settings. The purpose of the Net view is to generate all 
the hies required to specify a network in Gaffe-compatible 
format. 

In Net view, two net construction options exist. 

3.2.1 Constructing the net from scratch 

Once the option is chosen to construct the net from scratch, 
the layers which are associated with data are added auto¬ 
matically. Network layers of various common types can be 
subsequently added below the data layers. The back-end 
framework automatically calculates the sizes of responses 
for the resulting layers. 

The settings for each layer can be edited using a con¬ 
textual smart-edit interface. The various properties of a 
layer are typically specihed in programming-language style 
‘scopes’. The smart-edit interface automatically provides 
layer description options depending on the position of the 
edit cursor within the layer description scope (see Figure 
[^. As each layer is completed, the net design so far can 
be viewed. The graphical nature of both this process and 
the smart-edit interface provides an easy-to-use approach for 
GNN net design. 

At the end of the process, the resulting net, typically 
called the ‘training net’ forms the basis for the related ‘de- 




































ploy net’ prototxt file and therefore, the latter is initialized 
as a slightly modified copy of the ‘training net’ prototxt 
file. All the aforementioned files, related to the newly con¬ 
structed net can be saved into a net configuration. This 
freshly created net configuration can be trained by follow¬ 
ing the steps detailed in Training view (Section |3.3| . 

3.2.2 Create modified version of an existing net 

Alternately, if the objective is to create a modified version 
of an existing net, a net needs to be chosen from the list of 
available net configurations. Making this choice pre-loads 
the associated prototxt files (train, deploy and optionally, 
the solver). Having done so, the subsequent modifications 
to the net can be done as described earlier. 

A helpful feature of the Net view is that the layers of a 
net are displayed in color-coded fashion for ease of visual¬ 
ization (see Figure]^. Another important feature is that 
the created nets are Caffe-compatible files. Therefore, the 
hies associated with a net conhguration (train prototxt hie, 
deploy prototxt hie etc.) can be utilized directly with Caffe, 
outside of Expresso as well. 

3.3 Training view 

The Training view is utilized to train the nets with user- 
specihed data. The work how proceeds as follows : 

1. Select a net for training : This choice is made from 
a list of previously created net conhgurations whose 
location information is cached. Alternately, unlisted 
nets can be added for faster look-up in future sessions. 

2. Set training parameters : If the net conhguration does 
not have a solver, one is dehned at this point. The 
solver is typically associated with a number of associ¬ 
ated parameters. These parameters are set next, with 
an option for modifying and saving them if the solver 
already exists (see Figure]^. 

3. Data association : The data to be used for training is 
chosen next. 

4. Choosing net vs external training : Two kinds of train¬ 
ing are commonplace with CNNs. The hrst kind in¬ 
volves training the net from scratch. The second kind 
involves extracting features obtained by passing train¬ 
ing data through a pre-trained net and tapping the 
output at pre-specihed layer(s) of the net - a task 
which can be done from the Experiment view. The 
extracted features are used to train an external clas- 
siher such as a Support Vector Machine (SVM) [^|^. 
If such a choice is made, the associated parameters 
(type of SVM kernel, related parameter settings) are 
specified. 

5. Commence training : Pressing the ‘Finish’ button in 
this step kick starts the net training process. A noti¬ 
fication panel provides information related to progress 
of training such as number of epochs completed, esti¬ 
mated time to completion and training error plots. 

At the end of training, a model file is created regardless 
of training choice (net-based or external-classifier-based). 

An important feature in the Train view is the ability to 
stop a particular training task. This is particularly useful if 
the training is not proceeding satisfactorily (e.g. too slowly 
or has an unacceptable loss function magnitude). 



Figure 4: Interface to upload and adjust training 
parameters in Train view. 


3.4 Experiment view 

The Experiment view can be used for three important 
tasks, each of which is described below. 

3.4.1 Feature extraction using pre-trained nets 

When data is provided to a pre-trained CNN, the corre¬ 
sponding output from CNN layers has been shown to act as 
informative and discriminative feature representation of 
the data. To facilitate feature extraction, the user is asked 
to first choose a pre-trained net and then choose the layer(s) 
of interest. Having made the choice, the user then selects the 
data to be provided to the net. The subsequent step of fea¬ 
ture extraction results in feature data which can be stored 
in a variety of data formats, such as the ones compatible 
with the popular libsvm software [^. 

3.4.2 Visualizing feature data 

Features from certain layers of CNNs tend to preserve 
and highlight various spatial aspects of input data [^. To 
help visualize such features, an option is provided wherein 
features extracted previously can be selected. Such visual¬ 
ization can lead to a better understanding of both the data 
and evolution of features at various layers of the net (see 

3.4.3 Testing pre-trained models 

To determine the performance of the CNN, previously un¬ 
seen data is processed by a trained CNN and the results are 
compared against ground truth. For this process, the net 
configuration is chosen first. Subsequently, the test data 
is selected. Once the testing concludes, various standard 
performance metrics such as accuracy (global and per-class) 
and the confusion matrix are displayed. As with Training 
view, testing using external classifiers (SVM) can also be 
performed on features extracted from CNN layers. 

4. ADDITIONAL NOTES 

Expresso is written in Python and has been developed on 
machines running Linux Ubuntu 14.04. The BSD-licensed 
source code for Expresso can be accessed on Github at https: 
//github. com/val-iisc/expresso Expresso’s project page 
(http://val.serc.iisc.ernet.in/expresso) provides in¬ 
structions and information regarding installation and usage. 
An illustrated set of tutorials have also been provided with 
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Figure 5: Visualizing layer features of a deep net¬ 
work 


Expresso in text and narrated video formats. These tutori¬ 
als, covering typical use scenarios, assist the user in gaining 
familiarity with Expresso’s functionality. 

We also note that since Expresso has been built atop 
Caffe, future improvements and extensions to the latter are 
readily conferred to Expresso. 

5. CONCLUSION 

In this paper, we have presented a brief overview of the 
first version of Expresso - a GUI tool for designing, training 
and using Convolutional Neural Networks. In the spirit of 
open-source movement, we hope that users will not only use 
the tool, but also extend it, thereby making Expresso the 
go-to GUI based tool for exploring deep learning. 
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